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TCCGGGGGTG GGGTTCCCAG GTAGACCCCG GCCCCTCCCG TGAGCCCCTT TACCCAGGCC 

GCCACCTCCT CCAGGGGGGC CAAGGCGTGC AAGGAGAGGA ACGTCCGCAC C ACGCC CTAT 
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CGACCTCGGA CAAGAGACCG TGGACAACAT CCTCAAGCGC CTCCGCCGTA TTGAGGGCCA 1820 

GGTGCGGGGG CTCCAGAAGA TGGTGGCCGA GGGCCGCCCC TGCGACGAGG TCCTCACCCA 1880 

GATGACCGCC ACCAAGAAGG CCATGGAGGC GGCGGCCACC CTGATCCTCC ACGAGTTCCT 1940 

GAACGTCTGC GCCGCCGAGG TCTCCGAGGG CAAGGTGAAC CCCAAGAAGC CCGAGGAGAT 2 000 

CGCCACCATG CTGAAGAACT TCATCTA 202? 
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glu val val gly gin glu 20 
leu ala gin ala tyr leu 40 
leu leu ala met ala val 60 
cys gin ala val gin arg 80 
asn ser val glu asp val 100 
pro arg lys val phe ile 120 
leu leu lys thr leu glu 140 
pro glu arg met pro pro 160 
thr glu glu glu ile ala 180 
glu glu glu ala leu leu 200 
ser leu leu glu arg phe 220 
ala leu gly ser pro pro 240 
lys thr ala glu ala leu 260 
ser leu val ser gly leu 280 
ala gly thr pro leu pro 300 
glu ala met glu arg leu 320 
glu ala gly arg ala leu 340 
glu val gly pro lys pro 360 
pro asp leu arg glu arg 380 
phe val arg glu ala arg 400 
asp lys ala phe his tyr 420 
gin ala his phe gly val 440 
asp pro lys ala pro pro 460 
464 
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ATP site 

MS YQVLARKWRPQTFADVVGQEHVLTALANGLS LGRI HHAYL F SGTRGVGKTS I ARL LAK 
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Q.A.LQKKFS P. .T. 
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. . AMVRT 
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. TT 


-MH. .FYQ.Y. 


. IN 


KQTL. 


. . SIRKI 


V . AINRDKLPNG . I . . . E . . T . 


.TF.KII . 


--VSA.Y.RF. 


. L . 


QE. . . 


KEP 


LKAIRE . . LAQ P . . . 





Zn ++ finger 
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VH .V E.E..KA N.I E KV 
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A) Alignment of TTH1 with alphas subunits of other organisms. 



E.coli 

V.chol. 

H.inf . 

R.prow. 

H.pyl. 

S.sp. 

M.tub. 



DRYF LEL I RTGRPDEE S YLHAAVELAEARGLP W 197 
DHFYLELIRTGRADEESYLHFALDVAEQYDLPW 197 
DHFYLALSRTGRPNEERYIQAALKLAERCDLPLV 197 
DRFYFEIMRHDLPEEQFIENSYIQIASELSIPIV 195 
DDFYLEIMRHGILDQRFIDEQVIKMSLETGLKII 213 
DDYYLEIQDHGSVEDRLVNINLVKIAQELDIKIV 202 
DNYF LELMDHGLT I ERRVRDGLLEIGRALNI PPL 220 
FFIEIQNHGLSEQK 



(ID#72) 
(ID#73) 
(ID#74) 
(ID#75) 
(ID#7 6) 
(ID#77) 
(ID#78) 
(ID#61) 



B) Alignment of TTH2 with alphas subunits of other organisms. 

E.coli NKRRAKNGEPPLDIAAIPLDDKKSFDMLQRSETTAVFQLESRGMKD 618 (ID#79) 

V.chol. NPRLKKAGKPPVRIEAIPLDDARSFRNLQDAKTTAVFQLESRGMKE 618 <ID#80) 

H.inf. NVRMVREGKPRVDIAAIPLDDPESFELLKRSETTAVFQLESRGMKD 618 (ID#81) 

R.prow. CKKLLKEQGIKIDFDDMTFDDKKTYQMLCKGKGVGVFQFESIGMKD 624 (ID#82) 

H.pyl. LKIIKTQHKISVDFLSLDMDDPKVYKTIQSGDTVGIFQIES-GMFQ 648 (ID#83) 

S.sp. QERKALQIRARTGSKKLPDDVKKTHKLLEAGDLEGIFQLESQGMKQ 643 (ID#84) 

M.tub. IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRD 646 (ID#85) 

T.th. RVE LD YD ALT LDD (ID#60) 
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A) 

Start codon 

ATGGGCCGGGAGCTCCGCTTCGCCCACCTCCACCAGCACA 

CCCAGTTCTCCCTCCTGGACGGGGCGCCGAAGCTTTCCGA 

CCTCCTC AAGTGGGTGGAGGAGACGACCCCCGAGGACCCC 120 

GCCTTGGCCATGACCGACCACGGCAACCTCTTCGGGGCCG 

TAGAGTTCTACAAGAAGGCCGCCGAAATGGGCATCGAGCC 

CATCCTGGGTACGAGGCCTTACGTGGCGGCGGAAAGCCCG 240 

TTTGACCGCAAGCGGGGAAAGGGCCTAGACGGGGGCTACT 

TTCACCTCACCCTCCTCGCCAAGGACTTCACGGGGTACCA 

GAACCTGGTGCGCCTGGCGAGCCGGGCTTACCTGGAGGGG 36 0 

TTTTACGAAAAGCCCCGGATTGACCGGGAGATCCTGCGCG 

AGCGCCGAGGGCCTCATCGCCTCTCGGGGTGCCTCGGGGC 

GGAGATCCCCCAGTTCATCCTCCAGGACCGTCTGGACCTG 4 8 0 

GCCGAGGCCCGGCTCAACGAGGACCTCTCCATCTTCAAGG 

ACCGCTTCTTCATTCACATCCAGAACCACGGCCTCCCCGA 

GC AGAAAAAGGTC AAC G AGGTC C TC AAGGAGTTCGC CCGA 600 

AAGTACGGCCTGGGGATGGTGGCCACCAACGACGGCCATT 

ACGGGAGGAAGGAGGCCCGCAGCGCCCACGAGGTTTTCCT 

CGCCATCCAGTCCAAGAGCACCCTGGACGACCCCGGGGCC 720 

GTTGGCTTTCCCCTGCGGGAGTTCTACGTGAAGACCCCCG 

AGGAGACGTGCGGGCCGGTGTTCCCCGAGGAGGAGTGGGG 

GGACGAGCCCTTTGACAACACCGTGGAGATCGCCCGCATG 840 

TGCAACGTGGAGCTGCCCATCGGGACAAGATGGTCTACCC 

GAATCCCCCGCTTCCCCCTCCCCGAGGGACCGGGGACCGA 

GGCCAAGTACCTAATGGAGCTAACCTTCAAGGGGCCCCTC 960 

CGCCGTTACCCGGACCGAATCACCGAGGGTTTCTACCGGG 

AGGTTTTCCGCCTTTTGGGGAAGCTTCCCCCCCACGGGCA 

CGGGGAGGCCTTGGCCGAGGCCTTGGCCCAGGTGGAGCGG 1080 

GAGGCTTGGGAGAGGCTCATGAAGAGCCTCCCCCCCTTTG 

ACCGGGGTCCAAGGAGTTCCA 



MGRELRFAHLHQHTQFSLLDGAPKLSDLLKWVEETTPEDP 
ALAMTDHGNLFGAVEFYKKAAEMGIEPILGTRPYVAAESP 
FDRKRGKGLDGGYFHLTLLAKDFTGYQNLVRLASRAYLEG 12 0 
FYEKPRIDREILRERRGPHRLSGCLGAEIPQFILQDRLDL 

FFIEIQNHGLSEQK 
AEARLNEDLS I FKDRFF IHIQNHGLPEQKKVNEVLKEFAR 
KYGLGMVATNDGHYGRKEARSAHEVFLAIQSKSTLDDPGA 240 
VGFPLREFYVKTPEETCGPVFPEEEWGDEPFDNTVEIARM 
CNVELPIGTRWSTRIPRFPLPEGPGTEAKYLMELTFKGPL 
RRYPDRITEGFYREVFRLLGKLPPHGHGEALAEALAQVER 3 60 
EAWERLMKSLPPFDRGPRSS 
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Startl Start2 3'-Exo I 

T . th . VERWRTLLDGRFLLEEGVGLWEWRYPFPLEGEAVWIjDLETTGLAG LDEVIEVGLLRLEGG RRLPF 

D • rad . PWPQDWVFDLETTGFSPA SAAIVEIGAVRIVGGQIDETLKF 

Bac.sub. HGIKMIYGMEANLVDDGVPIAYNAAHRLLEEETYVVFDVETTGLSAV YDTI IELAAVKVKGGE- - 1 IDKF 

H . inf . MINPNRQIVLDTETTGMNQLGAHYEGHCI IEIGAVELINRR-YTGNNX 

E . c . MSTAITRQIVLDTETTGMNQIGAHSEGHKI IEIGAVEWNRR-LTGNNF 

H.pyl . NLEYLKACGLNFIETSENLITLKNLKTPLKDEVFSFIDLETTGSCPI KHEI LEIGAVQVKGGE- - I INRF 

3'-Exo II 

T.. th . QSLVR-PLPP AEARSWNLT GIPREALEEAPSLEEVLEKAYPLRGDATLVIHNRAPDLGFL-RPALEGLG 

D\ rad . ETLVR- PTRPDGSMLSI PWQAQRVHGISDEMVRRAPAXKDVLPDFFDFVDGSAWAHNVSFDGGFM-RAGAERLG 

Bac . sub. EAFAN- PHRP LSATIIELT GITDDMLQDAPDVVDVIRDFREWIGDDILVAHNASFDMGFL-NVAYKKLL 

ll inf . HIYIK- PDRP XDPDAIKVH GITDEMLADKPEFKEVAQDFLDYINGAELLIHNAPFDVGFM-DYEFRKLN 

EL C . HVYLK-DRLV DPEAFGVH GIAVDFLLDKPTFAEVAVEFMDYIRGAELVXHNAAFDIGFM-DYEFSLLK 

H . pyl . ETLVKVKSVP DYIAELT GITYEDTLNAPSAHEALQELRLFLGNSVFVAHNftNFDYNFLGRYFVEKLH 

kl 3 '-Exo I IIC 

th . YRLENPWDSLRLARRGLPGLRRYGLDALSEVLELPRRT — C HRALEDVERT LAWH EVYYMLT SG 

rad . LSWAPERELCTMQLSRRAF PRERTHNLTVLAERLGLEFAPGGRHRSYGDVQVTAQAYLRLLELLG ER 

Bac.sub. E— -VEKAKNPVIDTLELGRFLYPEFKNHRLNTLCKKFDIELTQ--HHRAIYDTEATAYLLLKMLKDAA EK 

W. inf . -IJWKTDDICLVTDTLQ^1ARQMYPGKKN-NLDALCDRLGIDNSKRTLHGAIJLDAEILADVYLMMTGGQTNLFDEEE 

ill e . RDIAKTNTFCKVTDSLAVARKMFPGKRN-SLDALCARYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTSMAFAME 

jHupyl. CPLLNLKLCTLDLSKRAILSMRY-SLSFLKELLGFGIEV — SHRAYADALASYKLFEICLLNLP — SYIKT 



Figure 17 
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A) 

ATGGTGGAGCGGGTGGTGCGGACCCTTCTGGACGGGAGGT 4 0 
TCCTCCTGGAGGAGGGGGTGGGGCTTTGGGAGTGGCGCTA 
CCCCTTTCCCCTGGAGGGGGAGGCGGTGGTGGTCCTGGAC 120 
CTGGAGACCACGGGGCTTGCCGGCCTGGACGAGGTGATTG 
AGGTGGGCCTCCTCCGCCTGGAGGGGGGGAGGCGCCTCCC 200 
CTTCCAGAGCCTCGTCCGGCCCCTCCCGCCCGCCGAAGCC 
CGTTCGTGGAACCTCACCGGC ATCCCCCGGGAGGCCCTGG 280 
AGGAGGCCCCCTCCCTGGAGGAGGTTCTGGAGAAGGCCTA 
CCCCCTCCGCGGCGACGCCACCTTGGTGATCC ACAACGCC 360 
GCCTTTGACCTGGGCTTCCTCCGCCCGGCCTTGGAGGGCC 
TGGGCTACCGCCTGGAAAACCCCGTGGTGGACTCCCTGCG 440 
CTTGGCCAGACGGGGCTTACCAGGCCTTAGGCGCTACGGC 
CTGGACGCCCTCTCCGAGGTCCTGGAGCTTCCCCGAAGGA 520 
CCTGCCACCGGGCCCTCGAGGACGTGGAGCGCACCCTCGC 
CGTGGTGC ACGAGGTATACTATATGCTTACGTCCGGCCGT 600 
CCCCGCACGCTTTGGGAACTCGGGAGGTAG 



B) 

MVERWRTLLDGRFLLEEGVGLWEWRYPFPLEGEAVWLD 40 
LETTGLAGLDEVIEVGLLRLEGGRRLPFQSLVRPLPPAEA 
RSWNLTGIPREALEEAPSLEEVLEKAYPLRGDATLVIHNA 120 
AFDLGFLRPALEGLGYRLENPWDSLRLARRGLPGLRRYG 
LDALSEVLELPRRTCHRALEDVERTLAWHEVYYMLTSGR 200 
PRTLWELGRZ 



Figure 18 
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Alignment of dnaA genes. 



P. mar. MLEASWEK VQSSL — KQNLSK — PSYE TWIRPTEFSG- 

Syn.sp. MVSCENLWQQ ALAIL— ATQLTK PAFD TWIKASVLIS- 

B.SUt. MENILDLWNQ ALAQI — EKKLSK — PSFE TWMKSTKAHS- 

M.tub. MTDDPGSGFTTVWNA WSELNGDPKVDDGP SSDANLSAPLTPQQR AWLNLVQPLT- 

T.th. MSHEAVWQH VLEHI— RRSITE— VEFH TWFERIRPLG- 

E.coli MSLSLWQQ CLARL — QDELPA — TEFS MWIRPLQAE- ■ 

T.mar. MKER ILQEI — KTRVNR — KSWE LWFSSFDVKS- 

H.pyl. 



LJKNNYSQTIQETAE- 65 

LQKSYGPLLMEVLT- 67 

LESRYLHLIADTIY- 67 

IERHLRAPITDALS- 87 

IRRHYAGLIQEGPR- 66 

VRDKYLNNINGLLT- 64 

LEKKYYSVLSKAVK- 61 

MDTNNNIEKE ILALVKQNPKVSL— 1 EYE NYFSQLKYNPNASKS DIAFFYAPNQVLCTT ITAKYGALLKEILSQ 72 



-FKN GELTLIAPNSFSSAW I 
-LGD GVATIQVENGFVLNH I 
-LQG DTLT ITAPNEFARDW I 
-IVE GFALLSVPSSFVQNE ] 
-IRD GVLELAVPTSFALDW ] 
-LSD NTLALYAPNRFVLDW \ 
-IEG NKWFSVGNLFIKEW I 



P. mar. 
Syn.sp 
B.sut 
M.tub. 
T.th. 

E.coli 
H.pyl 



EIFG EPVTVHVK VKANAESSDEHYSSA P ITPPLEASPGSV DSSGSSLRLSK KTLPLLNLRYVFNR 

DLTG QEITVKLI TDGLEPHS— LIGQ E SSLPMETTP KNATALNGKYTFSR 

ELTG EELSIKFV I PQNQDVEDFMPKPQ VKKAVKEDTSDFPQN MLNPKYTFDT 

RRIGH-Q IQLGVRI A PPATDEADDTTVPPS ENPATTSPDTTTDND EIDDSAAARGDNQHS WPSYFTERPHNTDSA TAGVTSLNRRYTFDT 

LLGAQ-APRFELRW PGVWQEDIFQPPPS PPAQAQP EDTFKT 

SFCGADAPQL RFE VG TOPVTQTPQAAVTSN VAAPAQVAQTQPQRA APSTRSGWDNVPAPA EP TYRSNVNVKHTFDN 

LNPDYTFEN 

VKDSYTFEN 



WLG NDATFEIT YEAFEPHSSYSEPLV KKRAVLXiTP - - 

NKVG-MHLAHSVDVR IEVAPKIQINAQSNI NYKAIKTS — 



119 

176 
108 
140 



FWGPNSRMAHAAAM 
Syn.sp. FWGPTNRMAHAASL 
FVIGSGNRFAHAASL 
FVIGASNRFAHAAAL, 
SWWGPTTPWPHGGAV 
FVEGKSNQLARAAAR 
FWGPGNSFAYHAAL 
FWGSCNNTVYEIAK 



til tub 
%.th. 
iicol: 
Tlmar 
*.pyl 



AVAESPGREFNPLFI 
AVAESPGREFNPLFL 
AVAEAPAKAYNPLFI 
AIAEAPARAYNPLFI 
AVAESPGRAYNPLFI 
QVADNPGGAYNPLFL 
EVAKHPGR-YNPLFI 
KVAQSDTPPYNPVLF 



CGGVGLGKTHLMQAI 
CGGVGLGKTHLMQAI 
YGGVGLGKTHLMHAI 
WGESGLGKTHLLHAA 
YGGRGLGKTYLMHAV 
YGGTGLGKTHLLHAV 
YGGVGLGKTHLLQSI 
YGGTGLGKTHILNAI 



GHYRLEIDPGAKVSY 
AHYRLEMYPNAKVYY 
GKYVIDHNPSAKWY 
GNYAQRLFPGMRVKY 
GPLRAKRFPHMRLEY 
GNGIMARKPNAKWY 
GNYWQNEPDLRVMY 
GNHALEK — HKKWL 



VSTETFTNDLIL — A 
VSTERFTNDLIT- -A 
LSSEKFTNEFIN — S 
VSTEEFTNDFIN — S 
VSTETFTNELINRPS 
MHSERFVQDMVK- -A 
ITSEKFLNDLVD — S 
VTSEDFLTDFLK- -H 



IRQDRMQAFRDRYR- 
IRQDNMEDFRSYYR- 
IRDNKAVDFRNRYR- 
LRDDRKVAFKRSYR- 
AR-DRMTEFRERYR- 
LQNMAIEEFKRYYR- 
MKEGKLNEFREKYRK 
LDNKTMDSFKAKYR- 



217 
202 
206 



:Syn.S] 
'W. sut 
Ifltub. 
:T;.th. 
,E=. col: 
"fe mar 
Hipyl 



AADLILVDDIQFIEG 
SADFLLIDDIQFIKG 
NVDVLLIDDIQFLAG 
DVDVLLVDDIQFIEG 
SVDLLLVDDVQF I AG 
SVDALLIDDIQFFAN 
KVDILLIDDVQFLIG 
HCDFFLLDDAQFLQG 



KEYTQEEFFHTFNAL 
KEYTQEEFFHTFNSL 
KEQTQEEFFHTFNTL 
KEGIQEEFFHTFNTL 
KERTQEEFFHTFNAL 
KERSQEEFFHTFNAL 
KTGVQTELFHTFNEL 
KPKLEEEFFHTFNEL 



HDAGSQXVLASDRPP 
HEAGKQVWASDRAP 
HEESKQIVTSSDRPP 
HMANKQXVISSDRPP 
YEAHKQI ILSSDRPP 
LEGNQQI ILTSDRYP 
HDSGKQIVICSDREP 
HANSKQIVLISDRSP 



SQIPRLQERLMSRFS 
QRI PGLQDRLI SRFS 
KEI PTLEDRLRSRFE 
KQLATLEDRLRTRFE 
KDILTLEARLRSRFE 
KEINGVEDRLKSRFG 
QKLSEFQDRLVSRFQ 
KNIAGLEDRLKSRFE 



MGLIADVQAPDLETR 
MGLIADIQVPDLETR 
WGLITDITPPDLETR 
WGLITDVQPPELETR 
WGLITDNPAPDLETR 
WGLTVAIEPPELETR 
MGLVAKLEPPDEETR 
WGITAKVMP PDLETK 



MAILQKKAEHERVGL, 
MAILQKKAEYDRIRL 
IAILRKKAKAEGLDI 
IAILRKKAQMERLAV 
IAILKMNAS-SGPED 
VAILMKKADENDIRL 
KSIARKMLE IEHGEL 
LSIVKQKCQLNQITL 



292 
296 



P. mar. 
Syn.sp. 



T.mar 
H.pyl 



PRDLIQFIAGRFTSN 
PKEVI EY I ASHYTSN 
PNEUMLYIANQIDSN 
PDDVLELIASSIERN 
PEDALEYIARQVTSN 
PGEVAFF I AKRLRSN 
PEEVLNFVAENVDDN 
PEEVMEYIAQHISDN 



IRELEGALTRAIAFA 
IRELEGALIRAIAYT 
IRELEGALIRWAYS 
IRELEGALIRVTAFA 
IREWEGALMRASPFA 
VRELEGALNRVIANA 
LRRLRGAI IKLLVYK 
IRQMEGAI IKI SVNA 



SITGLPMTVDSIAPM 
SLSNVAMTVENIAPV 
SLINKDINADLAAEA 
SLNKTPIDKALAEIV 
SLNGVELTRAVAAKA 
NFTGRAITIDFVREA 
ETTGKEVDLKEAILL 
NLMNASIDLNLAKTV 



LD PNGQGVEVT 

IjN PPVEKVAAA 

UCDII-PSSKPKVIT 
LRDLI-ADANTMQIS 
LRHLR-P— RELEAD 
LRDLL-A-LQEKLVT 
LKDFIKPNRVKAMDP 
LEDL — QKDHAEGSS 



PKQVLDKVAEVFKVT 
PETI ITIVAQHYQLK 
IKEIQRWGQQFNIK 
AATIMAATAEYFDTT 
PLEI IRKAAGPVRPE 
IDNIQKTVAEYYKIK 
IDELIEIVAKVTGVP 
LENILLAVAQSLMLK 



PDEMRSASRRR - PVS 
VEELLSNSRRR-EVS 
LEDFKAKKRTK-SVA 
VEELRGPGKTR-ALA 
TPGGAHGERRKKEW 
VADLLSKRRSR- SVA 
REEILSNSRNV-KAL 
SSEIKVSSRQK-NVA 



441 
372 
404 



P. mar. 
Syn.si 
B.sut 
M.tub 
T.th. 
E.col: 
T.mar. 
H.pyl. 



QARQVGMYLMRQGTN LSLPRIGDTFGGKDH TTVMYAIEOVEKKLS S DPQIA SQVQKIRDLLQIDSR RKR 461 

LARQVGMYLMRQHTD LSLPRIGEAFGGKDH TTVMYSCDKITQLQQ K DWETS QTLTSLSHRINIAGQ APES 447 

FPRQIAMYLSREMTD SSLPKIGEEFGGRDH TTVIHAHEKISKLLA D DEQLQ QHVKEIKEQLK 446 

QSRQIAMYLCRELTD LSLPKIGQAFG-RDH TTVMYAQRKILSEMA E RREVF DHVKELTTRIRQRSK R 507 

LPRQLAMYLVRELTP ASLPE IGQLFGGRDH TTVRYAIQKVQELAG KP DREVQ GLLRTLREACTDPVD NLWITCG 446 

RPRQMAMALAKELTN HSLPEIGDAFGGRDH TTVLHACRKIEQLRE E SHDIK EDFSNLIRTLSS 467 

TARRIGMYVAKNYLK SSLRTIAEKFN-RSH PWVDSVKKVKDSLL KG NKQLK ALIDEVIGEISRRAL SG 440 

IiARKLWYFARLYTP NPTLSLAQFLDLKDH SSISKMYSGVKKMLE EEKSPFVLSLREEIK NRLNELNDKKTAFNS SE 457 



Figure 19 
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A) 

GTGTCGCACGAGGCCGTCTGGCAACACGTTCTGGAGCACA 

TCCGCCGCAGCATCACCGAGGTGGAGTTCCACACCTGGTT 

TGAAAGGATCCGCCCCTTGGGGATCCGGGACGGGGTGCTG 120 

GAGCTCGCCGTGCCCACCTCCTTTGCCCTGGACTGGATCC 

GGCGCCACTACGCCGGCCTCATCCAGGAGGGCCCTCGGCT 

CCTCGGGGCCCAGGCGCCCCGGTTTGAGCTCCGGGTGGTG 240 

CCCGGGGTCGTAGTCCAGGAGGACATCTTCCAGCCCCCGC 

CGAGCCCCCCGGCCCAAGCTCAACCCGAAGATACCTTTAA 

AACTTCGTGGTGGGGCCCAACAACTCCATGGCCCCACGGC 3 60 

GGCGCCGTGGCCGTGGCCGAGTCCCCCGGCCGGGCCTACA 

ACCCCCTCTTCATCTACGGGGGCCGTGGCCTGGGAAAGAC 

CTACCTGATGCACGCCGTGGGCCCACTCCGTGCGAAGCGC 480 

TTCCCCCACATGAGATTAGAGTACGTTTCCACGGAAACTT 

TCACCAACGAGCTCATCAACCGGCCATCCGCGAGGGACCG 

GATGACGGAGTTCCGGGAGCGGTACCGCTCCGTGGACCTC 600 

CTGCTGGTGGACGACGTCCAGTTCATCGCCGGAAAGGAGC 

GCACCCAGGAGGAGTTTTTCCACACCTTCAACGCCCTTTA 

CGAGGCCCACAAGCAGATCATCCTCTCCTCCGACCGGCCG 720 

CCCAAGGACATCCTCACCCTGGAGGCGCGCCTGCGGAGCC 

GCTTTGAGTGGGGCCTGATCACCGACAATCCAGCCCCCGA 

CCTGGAAACCCGGATCGCCATCCTGAAGATGAACGCCAGC 840 

AGCGGGCCTGAGGATCCCGAGGACGCCCTGGAGTACATCG 

CCCGGCAGGTCACCTCCAACATCCGGGAGTGGGAAGGGGC 

CCTCATGCGGGCATCGCCTTTCGCCTCCCTCAACGGCGTT 9 60 

GAGCTGACCCGCGCCGTGGCGGCCAAGGCTCTCCGACATC 

TTCGCCCCAGGGAGCTGGAGGCGGACCCCTTGGAGATCAT 

CCGCAAAGCGGCGGGACCAGTTCGGCCTGAAACCCCGGGA 1080 

GGAGCTCACGGGGAGCGCCGCAAGAAGGAGGTGGTCCTCC 

CCCGGCAGCTCGCCATGTACCTGGTGCGGGAGCTCACCCC 

GGCCTCCCTGCCCGAGATCGACCAGCTCAACGACGACCGG 1200 

GACCACACCACGGTCCTCTACGCCATCCAGAAGGTCCAGG 

AGCTCGCGGAAAGCGACCGGGAGGTGCAGGGCCTCCTCCG 

CACCCTCCGGGAGGCGTGCACATGA 



B) 

VSHEAVWQHVLEHIRRSITEVEFHTWFERIRPLGIRDGVL 

ELAVPTSFALDWIRRHYAGLIQEGPRLLGAQAPRFELRW 

PGVWQEDIFQPPPSPPAQAQPEDTFKTSWWGPTTPWPHG 120 

GAVAVAESPGRAYNPLFIYGGRGLGKTYLMHAVGPLRAKR 

FPHMRLEYVSTETFTNELINRPSARDRMTEFRERYRSVDL, 

LLVDDVQFIAGKERTQEEFFHTFNALYEAHKQIILSSDRP 240 

PKDILTLEARLRSRFEWGLITDNPAPDLETRIAILKMNAS 

SGPEDPEDALEYIARQVTSNIREWEGALMRASPFASLNGV 

ELTRAVAAKALRHLRPRELEADPLEI IRKAAGPVRPETPG 360 

GAHGERRKKEWLPRQLAMYLVRELTPASLPEIDQLNDDR 

DHTTVLYAIQKVQELAESDREVQGLLRTLREACT 



Figure 20 
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A) 



ATGAACATAACGGTTCCCAAAAAACTCCTCTCGGACCAGC 4 0 
TTTCCCTCCTGGAGCGCATCGTCCCCTCTAGAAGCGCCAA 
CCCCCTCTACACCTACCTGGGGCTTTACGCCGAGGAAGGG 12 0 
GCCTTGATCCTCTTCGGGACCAACGGGGAGGTGGACCTCG 
AGGTCCGCCTCCCCGCCGAGGCCCAAAGCCTTCCCCGGGT 200 
GCTCGTCCCCGCCCAGCCCTTCTTCCAGCTGGTGCGGAGC 
CTTCCTGGGGACCTCGTGGCCCTCGGCCTCGCCTCGGAGC 280 
CGGGCCAGGGGGGGCAGCTGGAGCTCTCCTCCGGGCGTTT 
CCGCACCCGGCTCAGCCTGGCCCCTGCCGAGGGCTACCCC 360 
GAGCTTCTGGTGCCCGAGGGGGAGGACAAGGGGGCCTTCC 
CCCTCCGGACGCGGATGCCCTCCGGGGAGCTCGTCAAGGC 440 
CTTGACCCACGTGCGCTACGCCGCGAGCAACGAGGAGTAC 
CGGGCCATCTTCCGCGGGGTGCAGCTGGAGTTCTCCCCCC 520 
AGGGCTTCCGGGCGGTGGCCTCCGACGGGTACCGCCTCGC 
CCTCTACGACCTGCCCCTGCCCCAAGGGTTCCAGGCCAAG 600 
GCCGTGGTCCCCGCCCGGAGCGTGGACGAGATGGTGCGGG 
TCCTGAAGGGGGCGGACGGGGCCGAGGCCGTCCTCGCCCT 680 
GGGCGAGGGGGTGTTGGCCCTGGCCCTCGAGGGCGGAAGC 
GGGGTCCGGATGGCCCTCCGCCTCATGGAAGGGGAGTTCC 760 
CCGACTACCAGAGGGTCATCCCCCAGGAGTTCGCCCTCAA 
GGTCCAGGTGGAGGGGGAGGCCCTCAGGGAGGCGGTGCGC 840 
CGGGTGAGCGTCCTCTCCGACCGGCAGAACCACCGGGTGG 
ACCTCCTTTTGGAGGAAGGCCGGATCCTCCTCTCCGCCGA 920 
GGGGGACTACGGCAAGGGGCAGGAGGAGGTGCCCGCCCAG 
GTGGAGGGGCCGGACATGGCCGTGGCCTAC AACGCCCGCT 1000 
ACCTCCTCGAGGCCCTCGCCCCCGTGGGGGACCGGGCCCA 
CCTGGGCATCTCCGGGCCCACGAGCCCGAGCCTCATCTGG 1080 
GGGGACGGGGAGGGGTACCGGGCGGTGGTGGTGCCCCTCA 
GGGTCTAG 112 8 



MNITVPKKLLSDQLSLLERIVPSRSANPLYTYLGLYAEEG 40 
ALILFGTNGEVDLEVRLPAEAQSLPRVLVPAQPFFQLVRS 
LPGDLVALGLASEPGQGGQLELSSGRFRTRLSLAPAEGYP 120 
ELLVPEGEDKGAFPLRTRMPSGELVKALTHVRYAASNEEY 
RAIFRGVQLEFSPQGFRAVASDGYRLALYDLPLPQGFQAK 200 
AWPARSVDEMVRVLKGADGAEAVLALGEGVLALALEGGS 
GVRMALRLMEGEFPDYQRVIPQEFALKVQVEGEALREAVR 280 
RVSVLS DRQNHRVDLLLE EGR ILL S AEGDYGKGQEEVPAQ 
VEGPDMAVAYNARYLLEALAPVGDRAHLGISGPTSPSLIW 360 
GDGEGYRAVWPLRVZ 



B) 



Figure 21 
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T . th . beta MNIWPKKIiSDQLSLIjERIVPSRSANPLYTYIj^ 

E . coli . bet MKFTVEREHU^PLQQVSGPIXXSRPTLPII^^ 

P . mirab . be MKFI IEREQLIJCPLQQVSGPLGGRPTLPII/3NLL^ 

H . inf 1 . bet MQFSISRENLIJCPLG^VCGVLSM^PNIPVIjNNV^ 

P . put . beta MHFTIQREALJJCPI^LVAGVVERRQTLPVLSNVLLWQGO^LSLTCT 

B . cap . beta MKFTIQNDILTKNLKKITRVLVKNISFPILENI^ 

T. th. beta AQSLP-RVLVPAQPFFQLVRSLPGDLVAI^lJ^EPGQGGQLELSSGRFRTRLSIJ^>AEGy 

E.coli . bet QPHEPGATTVPARKFFDICRGLP-EGAEIAVQLE GERMLVRSGRSRFSLSTLPAADF 

P . mirab . be QSHEIGATTVPARKFFDIWRGLP-EGAEISVRT if) GDRLLVRSGRSRFSLSTLPASDF 

H . inf 1 . bet SSSENGTFTIPAKKFLDICRTLS-DDSEITVTFE QDRALVQSGRSRFTLATQPAEEY 

P . put . beta EPAEPGEITVPARKLMDICKSLP-NDALIDIKVD EQKLLVKAGRS RFTLSTLPANDF 

B . cap . beta TKYIPGKTTISGRKILNICRTLS-EKSKIKMQLK NKKMYISSENSNYILSTLSADTF 

T . th . beta PELLWEGEDKGAFPLRTRMPSGELVKALTHVRYAASNEEYRAIFRGVQLEFSPQGFRAV 

E . coli . bet PNLDD — WQSEVEFTL PQAT MKRLIEATQFSMAHQDVRYYLNGMLFETEGEEIiRTV 

P . mirab . be PNLDD — WQSEVEFTL PQAT IJCRLIESTQFSMAHQDWYYIJOILFETENTELRTV 

H . inf 1 . bet PNLTD- -WQSEVDFELPQNT LRRLIEATQFSMANQDARYFLNGMKFETEGNLLRTV 

P . put . beta PTVEE — GPGSLTCNLEQSK IJ^IERTSFAMAC^DWYYIiNGMLLEVSRNTLRAV 

B . cap . beta PNHQN— FDYISKFDISSNI LiKEMIEKTEFS^KQDWYYIjNGMIXEKKDKFLRSV 

T . th . beta ASDGYRLALYDLPLPQGFQA — KAWPARSVDEMVRVIKGADGAEAVLALGEGA'IjAIAL 

E.coli . bet ATIXSHRIJWCSMPIGQSLPS-HSVrVPRKGVIELMRMLJ^-GDNPLRVQIGSNNIRAHVG 

P . mirab . be ATDGHRLAVCAMDIGQSLPG-HSVWPRKGVIEtMPXLDGSGESLI^LQIGSNNliRAHVG 

H . inf 1 . bet ATDGHRIAVCTISLEQELQN-HSVILPRK^ 

P . put . beta STDGHRLALCSMSAPIEQEDRHQVIVPRKGILEIjARLLTD-PEGMV^ 

B . cap . beta ATTCYRI^ISYTQII^KKDII^-FSIIIPNKAV^^ 

T . th . beta GGSGVRMALRLMEGEF PDYQRVI PQEFALKVQVEGEALREAVRRVSVLSDRQNHRVDLLL 

E.COli. bet DFIFTSKLVDGPJPDYRRVLPKNPDKHLEAGCDLI^QAFARAAILSNEKFRGWLYV 

P . mirab . be - --DFIFTSKLVDGRFPDYRRVLPKNPTKTVIAGCDII.KQAFSRAAILSNEKFRGVRINL 

H . inf 1 . bet NTVFTSKLIDGRFPDYRRVLPRNATKIVEG^IWE^CJKQAFARASILSNERARSVRLSL 

P . put . beta EFTFTSKIJVDGKFPDYERVIjPKGGDKLVVGDRQALKE^^SRTAILSNEKYRGIRLQL 

B . cap . beta NL IFTTQLIEGEyPDYKSVLFKEKKNPI ITNSILLKKSLLRVAILAHEKFCGIE1KI 

T . th . beta EEGRILiSAEGDYGK-GQEEVPAQVEGPDMAVAYNARYIJJEAIiAPVG-DPAHI^ISGPTS 

E . col i . bet SENQLKITANNPEQEEAEEILD\OTSGAEMEIGFWSYV^ 

P . mirab . be TNGQOCITANNPEQEEAEEIVDVQYQGEEMEIGFNVSYIjIjDVIjOT'LKCEE^TKIjLLTDAVS 

H . inf 1 . bet KFJJQLKITASOTEHEEAEEIVDVNYNGEFJjFArGFNVlYILD^ 

P . put . beta AAGQLKIQANNPEQEEAEEEI SVDYEGSSLE I GF1WS YLLDVLGVMTTEQVRLIL SDSNS 

B . cap . beta ENGKFKVLSDNQElEETAEDLFEIDYFGEXIEISINVYYLLDVINNIKSENIALiFIjNKSKS 



T.th.beta 
E.coli .bet 
P. mirab. be 
H. inf 1. bet 
P. put. beta 
B. cap. beta 



PSLIWGDG-EGYRAWVPLRVZ 
SVQIEDAASQSAAYWMPMRLZ 
SVQVENVASAAAAYWMPMRL- 
SCL IENCEDSSCEYVIMPMRL- 
SALLQEAGNDDSSYWMPMRL- 
SIQIEAENNSSNAYWMLLKR- 
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<ID#109) 
(ID#110) 
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Heat Step 
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B) MonoQ Column 

Fraction: 5 7 9 1 1131 5 1 71 9 212325 
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